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PETER HEGARTY 

Abstract. We propose a generalisation of the Cameron-Erdos conjecture for sum- 
free sets to arbitrary non-translation invariant linear equations over Z in three or more 
variables and, using well-known methods from graph theory, prove a weak form of 
the conjecture for a class of equations where the structure of the maximum-size sets 
avoiding solutions to the equation has been previously obtained. 



1. Introduction 

A set of integers A is said to be sum-free if there are no solutions in A to the equation 
x + y = z. In 1988, Cameron and Erdos [CE] proposed their famous conjecture that the 
number of sum- free subsets of [1, nj] is 0(2"/^), as n ^ ocO. The number n/2 arises 
naturally, since \n/2~\ is easily seen to be the maximum size of a sum-free subset of 
[1, n]. Indeed, the set of odd numbers in the interval, together with the interval (n/2, n], 
provide two explicit examples of sum-free sets of maximum size. Moreover, it is not 
hard to show that, for any odd n, these are the only two examples, whereas for n even 
there is one more, namely [n/2,n). Since any subset of a sum-free set is still sum- free, 
the main point of the Cameron-Erdos conjecture is that it says in essence that there are 
few large sum-free sets. 

As is well-known, this conjecture has been proven in recent years independently by 
Green |[Gl and Sapozhenko [Sa|. Somewhat less well-known is that many authors have 
worked on the following generalisation of the problem. Let A; > / be positive integers. 
A set A of integers is said to be (k, l)-sum-free if there are no solutions in A to the 
equation xi H — ■ + Xk = yi-\ — ■ + yi- For fixed k and /, there are two natural examples 
of a 'large' (A;, /)-sum-free subset of {1, n}, namely 



Ai = { j:^,^ 



(1.1) 



and 

A2 = {m E [l,n] : m = r (mod p)}, (1.2) 
where p is the smallest natural number not dividing k — / and r is any integer relatively 
prime to p. Note that \Ai\ ^ (^) ^ and \ A2\ ~ -n. The simplest and most natural 
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In fact, they made a more precise conjecture about the asymptotic number of sum-free sets. 
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generalisation of the Cameron-Erdos sonjecture to this setting is then the assertion that 
the number of (k, /)-sum-free subsets of [1, n] is 0(2'^'"'"), where Ck,i = max{^, 
There have been a considerable number of papers written on this problem, notably, in 
chronological order, llClil . flCTOl, [H, HScTH . [ISc2l and BO. The state-of-the-art is due 
to Lev 0, who established the conjecture (indeed, a more precise asymptotic) for all 
but finitely many pairs (k, I). 

The original problem has also been generalised in a quite different direction, namely 
where the interval [1, n] is replaced by a finite abelian group (the notion of 'sum-free' 
being defined in the same way), in particular Z„. We will not be concerned with this 
direction here, but the interested reader is referred to HGRL 

Rather we take our inspiration from the previous generalisation. Let Jff denote an arbi- 
trary linear equation over Z, which we may write as 

=Sf : aiXi + ■ ■ ■ + OfcXfc = b, ai, a^, 6 G Z. (1-3) 

A set A of integers is said to be ^-fre^ if there are no non-trivial solutions in A to 
(1.3). For an explanation of the meaning of 'non-trivial', see [RIJ - trivial solutions 
only arise when the equation is translation invariant, i.e.: ^ = 6 = 0. Let M_^{n) 
denote the maximum size of an ^-free subset of [l,n] and S^{n) the total number 
of =Sf-free subsets. It is natural to ask if, for any =Sf, one has S_^{n) = 0(2*^-^*^"^) 
? However, at this level of generality, the answer is easily seen to be negative. For 
suppose ^ is translation invariant. It follows from Szemeredi's celebrated theorem 
[ISzl that Mj^{n) = o{n), and from this it is easily deduced that S^{n) / Mj^{n) oo. 
Indeed, this observation was already made by Cameron and Erdos HCEI . For the specific 
examples of Sidon sets and sets avoiding arithmetic progressions of a fixed length (the 
latter referring to a system of linear equations rather than a single one), they asked if 
one could still recover a weaker result, namely is 

5^(n) = 0(2(i+°W)^^-^(")) ? (1.4) 

As far as we know, absolutely no progress has been made on this question, for any 
translation invariant =Sf . But this weaker conjecture is still interesting for historical rea- 
sons, since the first significant advances on the Cameron-Erdos conjecture for sum-free 
sets were made (independently) by Alon [AJ and Calkin [CJ, who both proved (1.4) 
in that case. They employed considerably different arguments. Alon's is based on an 
estimate for the number of independent sets in a regular graph, and works equally well 
in the group setting. Calkin makes use of Szemeredi's theorem. Both approaches had a 
profound influence on future work. Sapozhenko's proof of the sum-free conjecture cru- 
cially employs a version of Alon's graph theoretical result (Green's proof is different), 
which is both an improvement and an extension to graphs which are 'nearly' regulaiQ 
: see Lemma 2.4 below. Calkin's methods found greater purchase in the papers cited 
above extending the analysis to [k, /)-sum-free sets. 

Let us now state formally the conjecture we wish to propose in this paper : 



'In O, the term '^-avoiding' was used. 

^In a slightly different direction, Kahn BKll IIK2II proved an optimal result for bipartite graphs. 
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Conjecture 1.1. Let ^ be a linear equation over Z in at least three variables. For each 
n E N let M^{n) and Sy{n) denote, respectively, the maximum size of and the total 
number of ^ -free subsets o/{l, n}. Then, as n ^ oo, 

S^{n) = 0^(2(^+°(i))^'^-^(")) (1.5) 

and, if ^ is not translation invariant, then in fact 

= 0^(2*^-^(")). (1.6) 

We shall refer to (1.5) as the weak form of our conjecture and to (1.6) as the strong 
form (for non-invariant equations). 

Remark 1.2. The restriction to three or more variables is necessary, since even the 
weak conjecture fails miserably in two variables. For example, consider the equation 
^ : 2x = y.li is known (see the Proposition in Section 1 of 0) that a greedy choice 
mechanism yields the infinite ^-free set 

A = Modd,2 > 0} (1.7) 

and that, for every n. An is an ^-free subset of {1, ...,n} of maximum size. 
Hence M^{n)/n 2/3. On the other hand, a set i? C {1, is ^-free if and 

only if B is an independent set in the graph G = {V, E) where V = {1, ...,n} and 
{x,y} E E if 2x = y or 2y = X. This graph consists of \n/2~\ disjoint chains, one 
for each odd number in V. In a chain of length /, the number of independent sets is 
the Fibonacci number F^+i - here Fq = Fi = 1 and Ft = Ft^i + Ft^2 for alH > 1. 
To simplify the calculation of an estimate for S^{n), suppose n is a power of two, say 
n = 2'". Then the graph G contains one chain of length r + 1 and, for each j = 1, r, 
2^~^ chains of length r — j. It follows that 

r-1 

5^(20=F,+2X JJf^!.. (1.8) 

One easily checks that, for example. Ft > 7*"^ for all t, where 7 = ^^/^ . Thus 
^^(20 > 7'"'- where 

r-1 

a, = (r + l) + ^2^(r-l-j) = 2^ (1.9) 

j=0 

In other words, if n is a power of two, then S^{n) > 7". Since 7 > 2^/^, this means 
that (1.5) does not hold for if. 

It is quite easy to see that there is nothing special about this choice of ^ and that 
(1.5) fails to hold for any non-invariant equation in two variables. 

In considering Conjecture 1.1a very big problem seems to present itself, namely, for 
very few equations is the function M^(ri) even known. The structure of maximu nil 
^-free sets is, for most shrouded in mystery. This problem was introduced in 
very general terms by Ruzsa [|R1| [|R2|. but remains intractable. In particular, Ruzsa 

^For the remainder of this paper, the word 'maximum' is used synonymously with 'maximum size', 
and should not be confused with 'maximal', which denotes a weaker property having to do with location 
in a poset of sets ordered by inclusion. 
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conjectured that maximum <if-free sets should always 'look like' (1.1) or (1.2), but 
several examples are now known where this is not so. In this paper, we will consider 
a collection of equations in three variables where the structure of the maximal ^-free 
sets has been obtained in great detail [BH KLSH . IIHl . and is usually more complicated 
than (1.1) or (1.2). This background material is presented in Section 2 below. Our main 
result (Theorem 2.2) is that (1.5) holds for these equations. While this may seem like a 
very miniscule step on the way to establishing (1.5) or (1.6) in general, we believe our 
result motivates further study since we obtain it using the same graph theoretical tools 
developed by Alon, Sapozhenko et al. It thus suggests that these methods may have 
even wider applicability, at least to equations in three variables, to which they seem 
particularly suited. Indeed, as the number of variables is increased, other techniques 
may be more suitable, as in the papers cited above dealing with (A;, /)-sum-free sets. We 
shall return to this discussion in the last section. 

The rest of the paper is organised as follows. In Section 2, the main result is intro- 
duced and some preliminary lemmas listed. In Section 3, we prove the main result in 
a special case which illustrates all the main ideas involved and is the most technically 
challenging. This section is the heart of the paper. In Sections 4 and 5, we prove The- 
orem 2.2 in all remaining cases, giving only the briefest outline in the latter section. 
Section 6 is devoted to summarising remarks and suggestions for future work. 



2. Background material, statement of main result and preliminary 

LEMMAS 

We will be concerned with the following three groups of equations : 



Group 1 : x 



y 



cz, c > 2. 



Group 2 : x + by = cz,b > 1, c> . 
Group 3:b{x + y) = cz, b> 1, GCD(6, c) = 1. 

Let us first recall what is known about the maximum =Sf-free subsets of {1, ri}, when 
is one of the equations in Groups 1-3. To help with notation, we shall denote the 
equation ax + by = cz by J^a,b,c, and drop the subscripts whenever it is clear to which 



equation we are referring. To begin with, it was proven in [CGJ that Mj^f^ ^ , 



\n/2] 



and that the odd numbers in [l,n] form the unique maximum ^-free set for all n 0. 
In HBHKLSII it was shown that, for c> 3, M^, ,^(ra) = /(c) ■ n + 0(1), where 



/(c) 



(c-2)(c^-4) 
c(c4 - 2c2 - 4) ' 



(2.1) 



and that every maximum =Sf-free subset of {1, n} has a bounded symmetric differ- 
ence from the set 



l,c 



n] 



-UcVcU, UcVcU 



c 



1 


u 


p 1 


U -UcU, UcU 


-n, n 









where 



Ur 



2(c^ - 2) 
c4-2c2-4' 



(2.2) 



(2.3) 
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For equations in Group 2 it was proven in [El that M^^ = /(^^ c) ■ n + 0(1), 

where 



f{b,c) 



1 



b + 1 



c2 - b{b + 1) 



and that every maximum ^-free set has bounded symmetric difference from 



^l,f),c(^) 



6 + 1 



where 



-Wb.cU, Wb.cU 



b+1 



b + 1 



1 - 1(1+ . 1 , (2.4) 

U I — ^n,n , (2.5) 

= c^-6(6+l) - ^'-^^ 
For Group 3 equations, maximum free sets were also exhibited in [|H1|, though not com- 
pletely classified. There are three different cases : 



(i) If c > 26 then M_^^ ^ ^{n) = (l — ^) n + 0(1) and a maximum ^-free set is 



/26 


U |x G 


26 




— n, n 


1, — n 


: X 


V c 




c 





: a; ^ (mod 6) 



(2.7) 



(ii) If 2 < c < 26 then M^^^ ^ ^{n) = [l — n + 0(1) and a maximum ^-free set is 

A;_^_^(n) = {x e[l,n]:x^O (mod 6)}. (2.8) 

(iii) If c = 1 then M^^^^{n) = (l — ^) ^ and a maximum ^-free set is 

Remark 2.1. To use terminology introduced in JL|, the sets in (2.2), (2.5) and (2.9) are 
non-archimedean, those in (2.8) and for =^1,1,3 are archimedean, while that in (2.7) is 
mixed. We will return to this point in Section 6. 

Let us now state our main result. 

Theorem 2.2. Let =Sf be any of the equations in groups 1 -3 above. Then, as n 00, 

Sj^{n) = 0(2(i+°(i))^'^^(")). 

Our proof will make repeated use of the following two lemmas from the literature. 
The first one is a very simple observation : 

Lemma 2.3. For each e > there exists 6 > such that, for all sufficiently large 
integers n, the number of subsets o/{l, n} of size at most 6n is at most 2^". 

Proof. See Lemma 2 of [C I . □ 

The next result is the crucial one. Let G = (V, E) be a simple, loopless graph. 
Following |SaJ, we say that G is a (p, Z, k, m, a, /3, 6')-graph if it meets the following 
conditions : 



1. \V\ 



p. 



2. Every vertex has degree at least / and at most m. 

3. The fraction of vertices with degree less than k — 6 is at most a, and the fraction 
whose degree exceeds + is at most p. 



The following result is part of Theorem 2 of llSall : 
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Lemma 2.4. Let G be a {p, I, k, m, a, P, 9)-graph and let I{G) denote the number of 
independent sets in G. Then for large enough p, 



3. Proof of Theorem 2.2 for equations =Sfi,i,c with o 3 

The proof of the theorem in this case will illustrate all the key ideas involved, as well 
as being the most technically challenging. We thus present it in detail. Our approach 
is similar to that followed in HBHKLSII . but since we are not interested in the detailed 
structure of large =Sf -free sets the presentation will be modified to suit our present pur- 
poses. For the rest of this section, c denotes a fixed integer greater than 3 and denotes 
the equation =SJi,i,c- 

We will seek to apply Lemma 2.4, and for that we will apply a strategy of 'pairing 
off' elements in an =Sf-free set A with elements not in A. The latter is accomplished by 
means of the following simple fact : 

Lemma 3.1. Let Abe an ^ -free subset of {1, ...,n} and suppose x G An[l,^]. Then 
for each integer y & [cx — n, n], at most one of the numbers y and cx — y lies in A. 

Proof Obvious. See page 2 of [IBHKLSL □ 

To obtain the structure of maximum ^-free sets, the basic idea is to show that, if an 
^-free set A contains an element far from the three intervals identified in (2.2), then 
Lemma 3.1 can be applied to pair off a lot of numbers which keeps the size of A small. 
For Theorem 2.2, one essentially needs to extend this idea to show that, if an =Sf-free set 
contains a lot of elements outside these 'good' intervals, then the resulting collection of 
pairings will, when taken altogether, keep the number of possibilities for A small. This 
is where Lemma 2.4 comes in. Let's now get to the details. The first step is 

Lemma 3.2. Let A be an ^-free subset o/ {1, n}. The number of possibilities for 

Proof. Let e > be given. Let b > be as in Lemma 2.3, and n be much, much 
larger than is necessary for the conclusion of that lemma to hold (how much larger will 
become clear in due course). Let A be an =^-free subset of {1, n}. We distinguish 
two types of sets : 

I. Those A for which \An{^,^] \ < 5(f), 
n. All other A 

If A is of type I, then clearly the number of possibilities for A f] (f , n] is 0{2^^~c + c^^). 
So it remains to consider type II sets. Let A be any such set. Divide the interval (f , ^] 
into subintervals each of length [v^J or \y/ri\ . There must exist at least one such 
subinterval, say = [s,t], such that ^ < ^ - (f)f and n A\ > ^^/n. First con- 
sider a fixed t. Note that there are certainly no more than possibilities for fl A, 
so we also consider a fixed choice of the latter. Let G = (V, E) be the following graph 
: V = [cs — n] and (x, y) E E if there exists some z E y Ci A such that x + y = cz. 



I{G) < 2 



f (l+a(l-//fc)+/3(m/fc-l)+0{e/fc+^log fc/fc)) 



(2.10) 
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By Lemma 3.1, An [cs — n, n] must be an independent set in this graph. We now wish 
to apply Lemma 2.4. The graph G is a (p, /, k, m, a, /3, 6')-graph with parameters 

p — 1 + n — {cs — n) > -n, (3.1) 

I ^l,k ^m^\J^ nA\> -y/E, (3.2) 

/5 = ^ = 0, a = ^^^^ = 0,(l/x/^). (3.3) 
P 

To simplify notation, put cs — n :— Kn, where k = = 1 — ©g(l), so that p = 
1 + (1 — «;)n. From Lemma 2.4, it follows that the number of possibilities for 74n(fi;n, n\ 
is 0(2 2 (i+Oe(i/\Ai))^ Note that this estimate is not affected if we now allow the set 
J^n A to vary, still for a fixed t. Hence, for a fixed choice of t, the number of possibilities 
for a type II set equals the previous number times the number of possibilities for A n 
Ktn\. The latter factor is certainly no more than the number of possibilities for 
Ar\{!^,Ktn]. 

To summarise, we have shown that, for given e > and all n sufficiently large (de- 
pending on e), if A is an ^-free subset of {1, n} then the number 7r„ of possibilities 
for ^ n n] satisfies a recursion 

7r„ = o(2[(^-t)+i]"+ 2^(^+«^(VV^))7r.J , (3.4) 

\ \/c<k<kP J 

where = 1 — 0e(l) and the number of terms in the sum is 0{n) = 2'^(^°sn)_ Clearly, 
we can deduce Lemma 3.2 by iterating this recursion Og(l) times for fixed e and n, and 
then letting e— >Oasn— >oo. □ 

Lemma 3.3. Let A be an £^-free subset of {\, ...,n}. The number of possibilities for 

^ n [-^y n] is 2Mc)Mi+oii))^ ^^^^^ j^(^) = (1 _ + _^). 

Proof. Let e > be given, 5 as in Lemma 2.3 and n sufficiently large. Let t„ denote the 

where A is .if -free. This time, we consider 



number of possibilities for A n ^ 
three types of ^-free sets 



4n 

c(c2-2)' 



L Those A for which \A n f ] | < 5 ) and \A n f ]| < 5( i^^lE^), 
XL Those A for which |A n , f ]| < 5 (^), but \A n f ]| > 
m. Those A for which | A n ^] | > 5 (^) . 



c(c2-2 



First, regarding sets of type HI, we can argue as in the proof of Lemma 3.2 and de- 
duce that their number is 

/ \ 

O Y 2^(i+^^(V0i)r«„ , (3.5) 
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for some constant k° = 1 — Og(l). 



Secondly, regarding sets A of type I, the number of possibilites for A n {-^z^^n] is 
by assumption at most 2'^^^^~c)+o{e)] ^ gy Lemma 3.2, we also know that the number 
of possibilites for A n {^^^, ^] is 0(2^^"^^^^^''^^+°^^^^). Hence, the number of 
possibilities for A n {^^^^ n], when A is of type I, is clearly o(2-^i(^)-"(^+^(^))). 

To deal with type II sets will require a further application of Lemma 2.4. Let A be 
any such set and divide the interval (^J^, f ] into subintervals each of length [v^J or 
[^n]. Let = [s,t] be the rightmost subinterval such that D A\ > \\fn. We 
must have t > ^ + \^^^^E^\ Fix t and set s := ^ + si, t := ^ + ti and 
T :— min{t, ^532 + "^O- To simplify notation, assume that n is a multiple of (? — 2. 
For a fixed choice of n ^4, let G — (X, Y, E') be the following bipartite graph : 
X = [1, T], F = [ct - T, ct) and (x, y) ^ E \i there is some z ^ J r\ A such that 
X ^ y = cz. Notice that Y C (^,n], so in particular X HY = (p. By Lemma 3.1, 
An{X UY) must be an independent set in this graph. The graph G is easily seen to be 
a {p, I, k, m, a, /3, ^^)-graph with p = 2T and the remaining parameters as in (3.2) and 
(3.3). From Lemma 2.4 it follows that the number of possibilities for Ad {X \JY) is 
Q(2^(i+Oe(i/VH))-)_ Now we have 

A= {AnixuY))u{An (T, t])u(^An (t, u(^An ((^, n]\r)^ . (3.6) 

We have just given an upper bound for the number of possibilities for the first intersec- 
tion. Trivial bounds for the second and fourth are 0(2*^^) and 0(2'^^^")""^) respec- 
tively. Our assumptions also imply that there are no more than 2''*^^^*) possibilities for 
the third intersection. Since t — T < ^33^, it follows by an easy computation that the 
number of possibilites for a type II set is 0(2^^^'^^-"'(^+o('^))). All of this was for a fixed 
choice of t and n A, but allowing these to vary will not affect our estimate. 

To summarise, we have shown that, given e > 0, then for all sufficiently large n, 
I \ 

1 —4—<k<k" I 

\ c(c^-2) / 

for some = 1 — 0e(l). Iterating this recursion Oe(l) times for fixed e and then letting 
e — > as n — >^ 00 clearly leads to a proof of the lemma. □ 

Before we start with the proof of the theorem proper, let us note one further fact, 
whose proof is a straightforward iteration of the argument used in the proof of Lemma 
3.2: 

Lemma 3.4. Given ei, £2 > there exists 5 > such that, for all sufficiently large n, 
we can distinguish two types of^-free subsets A of{l, n}. 

Type 1 .• Those Aforwhich there is some y G (ein, n] such that |An(^, ^]| < (^). 
For each such y, there are 0(2^*^^+*^*^'^2)) possibilities for A n [y, n]. 



(3.7) 



INDEPENDENT SETS IN ALMOST-REGULAR GRAPHS AND THE CAMERON-ERDOS PROBLEM FOR NON-INVARIANT LINEAl 



Type 2 All other S^-free sets A. In this case, there are 0(2'' 2'^"(i+o{e2))) ^^/^^^ 
possibilites for A n (ein, n]. 

We are now ready to prove Theorem 2.2 for the equation =Sfi.i Let n be a large 
positive integer and let ^„ denote the number of ^-free subsets of {1, .... n}. Let e be a 
small positive real number (in the end we'll send e — > 0), let 5 be as in Lemma 2.3 and 
divide the J2f-free sets into the same three types as in the proof of Lemma 3.3. By the 
same arguments as used in the proof of that lemma, one finds that the number of type 
m sets is 0(Eo<K<^o 2^^(i+°^(VV^)^^^) for some k° = k° < 1, and the number 

of type n sets is 0{2^ =^-2 '^^ ^^"). We want to set up a recursion which will yield 
the theorem and thus it remains to bound the number of type I sets. Let ^7 „ denote the 
number of type I sets. We will show that, for any fixed constant cf) satisfying 

4 8 
c2(c4-2c2-2) ^"^^ c^ic^-AY ^^'^^ 

we have 

= O (2/(^)-(^+°(^)) + 2(^-?)"(^+«(^))e(^+,)„) . (3.9) 
Notice that, since < 32(^34) > one has 

f{c)(^ + A + (l--) <f{c). (3.10) 



c 



It is then clear that (3.9), together with the estimates for the numbers of sets of types n 
and III, suffice to set up a recursion which will yield the theorem. 

So for the remainder of the proof, we fix a choice of (f) satisfying (3.8), and our aim 
is to prove (3.9). Let ^ be an ^-free set of type I, and we consider two cases : 

Case 1 : There is at least one t e (n(^+0),^|^] such that the interval ^ = [t-^,t] 
contains at least 6\/n elements of A. 
Case 2 : There is no such t. 



In Case 2 we can immediately deduce that 

< (l + o(l))(5n0-|-0^ , (3.11) 

and from there an upper bound on the number of possibilities for A corresponding to 
the second term on the right hand side of (3.9). 

So to Case 1. Our aim is to show that the number of possibilities for A in this case is 

Q^2m-na+o{e))y poj. each t G (n + 0) , ^] , let be the family of all type-I 
=Sf -free sets A such that t is the largest integer for which the property in CASE 1 holds. 
Since n = 2°^'^^ it suffices to show that for each individual t one has 

= 0(2^(">"(^+°(^»), (3.12) 

for we can then sum over all t and it will not affect our estimate. 



/ /2 A 2n" 
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So we consider a fixed t. Let t = + (pt) and ^ = J't = [s, t\. We also consider 
a fixed choice of V\ A, as this will not affect our estimate either. Our argument will 
now follow a familiar pattern. Put s — ^ ^ s\ and t — ^ + ti. Let G — (X, y, E) be 
the bipartite graph where X = [1, cii], y = ^ + and (x, y) e £■ if x + y = 
for some z e fi A. Note that < -^^z^, so X and Y are certainly disjoint. We 
have 



A = (A n (X u F)) u (A n (cti, t]) u n (t, u n ((^, n]\F)^ . (3.13) 

As in the proof of Lemma 3.3, we can bound the number of possibilities for the first 
intersection by 0(2^*i(i+^'(^/^))). gy definition of t, there are 0(2^(^)") possibili- 
ties for the third intersection. Trivially, there are 0(2(1- c)"-^*i) possibilities for the 
fourth intersection. Unlike in Lemma 3.3 however, we will need a non-trivial bound 
on the number of options for the second intersection. It is now that we use the left- 
hand inequality of (3.8). It is easily checked to imply that ct\ > ^^Jz^ • Hence, since 

t — Q{n), we can use Lemma 3.3 to conclude that there are 0(2-^i('^) *(^+°(i^)) possibil- 
ities for the second intersection in (3.13). Another easy computation shows that this is 
already enough to yield (3.12) if and only if (pt < e2(c4-2c2-4) ' other words, if and 

only if t < i-^z2^^)n. So it remains to prove (3.12) when t > ( Jj^^^^^ )n. Note that, 
in that case, cti > . f * s . 

At this point, we use Lemma 3.4. Since ti = Q{t) it implies that either there are 
C>(22(i+o{e))) possibilities for A n (cti, t], which is even better than what Lemma 3.3 
yields, or there must be some y e {cti,t] satisfying the requirements of Lemma 3.4. As 
usual, we may consider a fixed y, since summing over y at the end will not affect our 
estimates. Trivially, we'll still have only 0(2(i-c)*(i+o(e))) possibilities for A n (cti, t], 
which remains better than what Lemma 3.3 gives, unless y/c—cti = 6(n) and positive, 
and \A n (cti, ^]| > — cti). Thus, in turn, we may assume that there is a largest 
integer z e {cti + ^/n, ^] such that A contains at least 6^/n elements from the interval 
J^z = [z — \/n, z] . Again, we may consider a fixed z, since summing over different z 
will not affect an estimate like (3.12). To be precise, let '-^z,y,t be the family of ^-free 
sets we are now considering for fixed z, y, t satisfying all our assumptions to date. It 
just remains to prove that 

\^,,y,t\ = 0(2^(^)-"(i+°(^))). (3.14) 

To simplify notation later on, let cti :— qt, where q > ^f^J_2) ■ Regarding the intersec- 
tion A n (cti, t]^ for fixed z, y we can still trivially bound the number of possibilities 
by 0(2'^i-~''*+*^^-''*^+*^'^^^*) and, together with the estimates for the other terms in (3.13), 
this is easily checked to already yield (3.14) unless z > 1/(^332 + '^)' where 

— — ^ 



q- , , . J • (3.15) 



- 2c2 - 4 V c(c2 - 2) 

Note, in particular, that r > 0. So we can now assume furthermore that z satisfies this 
condition. From this point onwards, we work with a different decomposition of a set 
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A e '^z,y,t than that considered in (3.13), namely 

A^{An[l,y])U{Ar}{y,t])U{An {t, n]) . (3.16) 

We already know there are 0(2(^~c)(*~2')(i+C'(«))) possibilities for the second intersec- 
tion and 0(2*^^^")"*^^+'^'^^))) possibilities for the third. We deal with the first intersection 
by one final application of graph theory. Fix a choice of fl A which, as usual, doesn't 
affect anything. Put z = + Zi and Z = mm{z, -^^h^) + czi). Let G = ^, S') 
be the bipartite graph where ^ = [1, Z], '3^ = [cz — Z, cz) and {x,y) & S ifx + y = cC, 
for some C G ^2 fi A. Lemma 2.4 can be applied in the usual manner to conclude that 
there are 0(2^(^+'^'(^/v^))) possibilities for A n U ^). We now consider a decom- 
position of ^4 n [1, y] analogous to (3.6), namely 

An [1, y] = (A n u ^)) u (A n [z, z\) u(^An {z, u (^A n (^(^, y]\^^ ^ . 

(3.17) 

We have just given an upper bound for the number of possibilities for the first inter- 
section. Trivial bounds for the second and fourth are 0(2^~^) and 0(2'^^~c)s/-^) j-g. 
spectively, while our definition of z means there are 0(2*^'^^)^) possibilities for the third. 
Plugging these estimates into (3.17), then the resulting estimate for the number of op- 
tions for An [1, y] back into (3.16), and finally a straightforward computation shows 
that (3.14) holds provided the variable r satisfies 

'■>'i+''W) (,-ig::2k4) («-^)- 

This is consistent with (3.15) for all n ^ 0, hence the proof of Theorem 2.2 for the 
equation ^i^c complete. 

4. Proof of Theorem 2.2 for equations ^\,b,c 

Let integers 6, c be given with h > 1 and c > ^^^z^- The argument that follows is 
similar to that in Section 3, in fact shorter but with some new technicalities. The latter 
arise because we don't have as simple a pairing mechanism as in Lemma 3.1 since, if 
z e A, then there exists an integer solution to x + by = cz if and only if a; = c^; (mod b). 

We start with a lemma analogous to Lemma 3.2 : 

Lemma 4.1. Let A be an ££-free subset 0/ {1, n). The number of possibilities for 
A n (n/c, n] is 0(2(i-^)'*(i+<'«)). 

Proof Let e > be given. Let 5 > be as in Lemma 2.3 and n be very large. Let A be 
an ^-free subset of {1, n}. We distinguish two types of sets : 

L Those A for which |An(^,^^^] I < 
n. All other A 

If A is of type I, then clearly there are o(2(i-^)"(i+o{^))) possibilities for An(^, ^^^]. 
Now consider type n sets. Divide the interval I^i^] into subintervals, each of 
length [i/nJ o^" \\P^^ ^^^^h that [6n/cJ is a right-hand endpoint of some such interval. 
At least one of the following two possibilities must arise : 
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(i) There is some such interval J = [s, t] such that H A\ > and C ^]. 

(ii) There is some such interval ^ = [s, t] such that ^ C i^^], t < - (|)^ 
and n A| > |v^. 

First suppose (i) occurs. There is at least one residue class x (mod b) such that y O A 
contains at least i\/n elements congruent to x (mod 6). Let denote the correspond- 
ing set of elements of y f] A. For a fixed choice of the parameters t, x and y^, let 
G — {X, Y, E) be the following bipartite graph : X — {x & \i,n] : x = x (mod 6)}, 
Y = [^^, and (x, y) e if there exists some z e J^^ such that x -\-hy ^ cz. 
The graph G is a (p, Z, /c, m, a, /3, ^)-graph where 

P=y + a,6(A/^), (4.1) 

; = 0, A; = m = I J^^^l > -Vri, (4.2) 

/3 = ^ = 0,o; = O,,6(l/V^). (4.3) 

Now A n (X U y) must correspond to an independent set in this graph so, by Lemma 
2.4, the number of possibilities for this intersection, for a fixed b, is 0(2?(^+'^^(^/^))). 
We now use the trivial decomposition 

A^[An{XUY)]U[Ar] ([1, n]\{X U Y))] . (4.4) 
Note that X and Fare not disjoint, but that |Xuy I = (| - ^)n + 0{y^). Hence the 
number of possibilities for the second intersection in (4.4) is trivially 0(2'-^" M^+o{i/Vn))^^ 
and hence the total number of possibilities for A is 0(2^^"^^"^^+'^^^^/^^)). Since 
c > ^^tz^, this number is also 0(2(i-^)'^(^+^^(i/^))). Allowing t, x and y^ to vary 
will not affect this estimate. 



Now we turn to Case (ii). As before, choose a congruence class x (mod 6) such that 

same three parameters as before fixed, let the real number 
K, be defined by setting cs := (b + l)n — (1 — K)n. Note that < k < 1 and 
K = 1—95(1) = 1 — 0e(l). This time the bipartite graph G = {X,Y,E) we wish to con- 
sider is the following : X — {x & [ku, n\ : x = x (mod 6)}, Y — [{1 — ^-^)n, n] and 
{x,y) e E if x+by = c^; for some ^ e y^^. By Lemma 2.4, there are 0(2f^(^+*^^(^/^))) 
possibilities for AO {X UY). We decompose a generic A as 

A^(Ar] [1, Ku]) U{Ar](XU Y)) U{Ar] (ku, n]\{X u y)) (4.5) 

and deduce by a similar argument as in Case (i) that the number of possibilities for 
A n (n/c, n] is now 0(2(^-^)(i-'")"(i+'^^(^/v^))7r«„). Allowing x and y^ to vary does 
not affect our estimate (note that k depends only on t). 

To summarise, what we have demonstrated is a recursion for tt^ of the form 

\ o<K<i-ee(i) / 
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where the number of terms in the sum is 0{ri) — 2"^"-\ By the usual reasoning, this is 
enough to estabUsh our lemma. □ 

We do not need an analogue of Lemma 3.3 this time round (since the extremal ^-free 
sets consist of two intervals only), so can go straight ahead to the proof of the theorem 
proper. Let n be large and ^„ denote the number of =5f-free subsets of {1, n}. Let 
e > 0, (5 as in Lemma 2.3 andn very large. Let be any fixed constant satisfying 

^ < ^ < c^-(,2-i) ' (4-^) 
and divide the J^-free sets into the following three categories : 

L Those sets A for which \An{^, \>S{^). 

n. Those A for which \A n ((/)n, ^^^] | < cSn{^ - 0). 
m. All other A. 



By following the argument in the proof of Lemma 4. 1 we can establish an upper bound 
for the number of sets of type I identical to the right-hand side of (4.6), just with 
TT replaced by ^. Regarding sets of type n, one immediately has these are at most 
O(2^^~^)"(^+'^^^))^0„) in number. One easily checks that the right-hand inequality of 
(4.7) implies that 

/(6,c)0+(^l-^^ <f(b,c). (4.8) 

Thus, in setting up a recursion for ^„ which will yield the theorem, sets of types I and 
n do not cause any problems. It remains to consider type-Ill sets, so let Cni,n denote 
their number. The definition of these sets clearly implies the existence of some largest 
t E {(f)n,^] such that the interval = [t — y/n,t] = [s.t] contains at least 5 elements 
of A. For each possible choice of such a t, let denote the correpsonding family of 
type-ni sets. It suffices to show that 

So consider a fixed t. Pick out a congruence class x (mod b) with the usual properties. 
For fixed x and J^^, put s := + si, t := ^^±^ + h, T := mm{t, ch}. Let 

G = (X, Y, E) be the following bipartite graph : X ^ {x e (^^^, + T] : x = 
X (mod b)},Y = [1, |-] and {x, y) E E if x + by — cz for some z e J^^. We decompose 
an =5f -free set A as 

A^{An{xu Y))u(^A n (p t]^ u(An {t, u(An (i^^^MV)) ■ 

(4.10) 

In the usual manner. Lemma 2.4 implies a bound for the first intersection of the form 

(9(2'fc *^^+'^'(^/^^^). The definition of t implies that the possibilities for the third inter- 
section are negligible. We use a trivial bound for the fourth. For the second, it is now 
that we use the left-hand inequality of (4.7). This is easily checked to guarantee that 
T/b > t/c, and hence we can get a bound from Lemma 4.1. Putting these four bounds 
together and letting x and vary is then easily checked to yield (4.9). 
This completes the proof of Theorem 2.2 for the equations ^i^b,c- 
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5. Proof in the remaining cases 

The reason for presenting our arguments in the previous two sections in detail was 
that those presented in HBHKL SI and [fT] for establishing the sizes of the maximum 
=Sf-free subsets of {1, n} needed to be modified to a form more suitable for proving 
Theorem 2.2. For the remaining equations, no such modifications are necessary and 
things proceed somoothly. 

5.1. The equation ^1,1,3. That \n/2~\ is the maximum size of an =Sf-free subset of 
{1, n} is easily established by induction on n. One divides the ^-free sets into two 
types, those that intersect (f , ^] and those that don't. For the latter type, the induction 
is set up immediately. For the former, one chooses a z G A fl (|, |] and pairs off 
elements in the interval [3a; — n, n] using Lemma 3.1. Then the induction step can be 
applied. 

To prove Theorem 2.2, one proceeds analogously. For given e, 5 one divides the =Sf- 
free sets into two types according as to whether \ A f] (|, |]| > or not. One then 
proceeds as in Section 3 to obtain a recursive estimate for the number ^„ of =Sf-free sets, 
but only one application of Lemma 2.4 will be necessary now. We omit further details. 

5.2. The equations ^b,b,c' In a similar manner, the argument establishing the size of 
a maximum ^-free subset of {1, ...,n}, as presented in [H], is already in such a form 
that it can be directly modified to prove Theorem 2.2. Thus, there is nothing new going 
on here either. 

6. Concluding remarks 

The main purpose of this paper has been simply to propose Conjecture 1.1, as this 
does not seem to have been done in the existing literature. We have proven the weak 
form of it for some new equations, using well-established methods. Obviously, one 
would like to know if the strong form holds for these equations and, indeed, if one can 
obtain a more precise asymptotic for the number of =:Sf-free subsets of {1, ...,n}. One 
would also like to generalise our results to the group setting. Perhaps this can be done 
with only technical modifications of existing results for sum-free sets. 

It is nevertheless a more fundamental question as to whether even the weak form of 
Conjecture 1.1 holds for any whatsoever. I suspect that, if a counterexample exists, 
then one can find one already in three variables. The intractability of the problem seems 
to lie in the difficulty in computing M^(n) in the first place for most Indeed, what 
our paper seems to show is that, in situations where M^{n) can be computed by some 
kind of pairing-off procedure (which seems most natural in three variables), something 
like Lemma 2.4 gives (1.5) as a bonus almost 'for free'. One can wonder if there might 
be a way to prove (1.5) without knowing M^{n). 
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